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ABSTRACT 


The classification technique and data forecasting will probably be one of the 
techniques that will often be needed in handling or managing big data. 
So, from that the author analyzes the possible development of the existing 
algorithms. The purpose is to find possibilities in the use of reliable 
algorithms in a particular field, then can be adopted and implemented to 
develop forecasting techniques in the future. Based on these considerations, 
the authors conducted experiments by applying LVQNN to conduct 
shortterm forecasting on daily period of the Rupiah exchange rate. 
The literature that is used as a reference is the discovery of architectural data 
classification processes that resemble forecasting techniques. So, when there 
is a combination of Rupiah exchange histories, it is possible to find these 
combinations into certain classes based on predetermined parameters and 
historical data combination data and forecast values in the past. In this 


research the factors chosen as indicators that affect the Rupiah exchange rate 
are the amount of exports, the amount of imports, the inflation rate and also 
the world oil price. In this research the highest accuracy value in the testing 
process for the population reached 99.0991%. The increase in the percentage 
value of forecasting accuracy is influenced by the composition of the data. 
In this study the formation of data composition is influenced by distinct data. 
The selection of parameters which become distinct claused determines how 
the composition of the data will be formed. If the composition of the data is 
not correct, the test results will not be good. If the number of weights vector 
is smaller than the input data, the forecasting accuracy will decrease. Because 
the weight vector cannot represent data combinations that used during 
training or testing. 
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1, INTRODUCTION 

Currency exchange rate plays an important role in financial markets. Exchange rates are determined 
in the foreign exchange market. Exchange rate stability is one important consideration for investors to take 
investment policies. If the exchange rate has high fluctuations, this can affect interest rates, commodity prices 
and a country's economic policy [1-4]. Then, if it is continued for a long time, it will affect investment policy. 
Thus, some application of forecasting techniques can help to read the direction of the movement of currency 
exchange rates, so that it can help investors in reading investment opportunities based on a country's 
exchange rate factor. Therefore, the risk of failure can be minimized and profit opportunities can be 
optimized. 
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There are several methods that can be used in analyzing and forecasting [5-15], which are classified 
into statistical techniques and artificial neural networks. Each of these categories has a lot of techniques and 
algorithms that can be chosen according to the purpose of data processing. This research has used LVQNN as 
an artificial neural network algorithm that has been selected to forecast the value of the Rupiah exchange rate 
against the US Dollar. The reason is because LVQNN has been proven to have reliability in data processing 
[16, 17]. 

In general, LVQN is widely used to classify digital images. Evidenced by various studies that have 
been done, LVQNN has advantages when compared to other algorithms. So, there are some studies that rely 
on LVQNN as an optimization algorithm for computing process of other algorithms [17-19]. However, 
LVQNN is not perfect, sometimes in certain cases LVQNN must be combined with other algorithms to 
achieve its best performance [16, 18-23]. 

In this study LVQNN will be applied to process data that is different from its common use. Previous 
studies the LVQNN is used to classify digital images. In this research, LVQNN is used to forecast by 
utilizing a numeric type of data. The similarity of LVQNN architecture with the methods commonly used in 
forecasting is the process of comparing data. The data compared is the current symptom data to the control 
class which is a documentation of past symptoms. Then this research will evaluate the use of LVQNN 
algorithm in processing numerical data to predict currency exchange rate, especially Indonesian Rupiah. 
Finally, the effect of data composition and data weight will be analysed. 


2. RESEARCH METHOD 

The research method that has been used in this study is the quasi-experimental method. This method 
is chosen because it is in accordance with the research objectives and the character of the data. The purposive 
sampling technique used has also indicated that the requirements for randomizing data for the true 
experimental method cannot be fulfilled, so the most appropriate methodology to choose is the quasi- 
experimental model. 

In this research some experiments conducted to prove that LVQNN which generally used to image 
classification can also be used to forecast. The reason for choosing LVQNN is with consideration of previous 
studies that have proven the reliability of LVQNN in classifying digital images [18, 19, 22, 23]. Besides that, 
based on its architecture, LVQNN is suitable to be implemented as one of forecast methods. 

The data which used in this study is a secondary data. The amount of data that has been used in this 
study as a population is 1671 rows of data. This data has been obtained from the official website of the 
Republic of Indonesia Trade Ministry [24]. The data consist of the exchange rate of the Indonesian Rupiah 
against the US Dollar, the amount of exports, the amount of imports, the inflation rates, and the world oil 
prices (ORB Price). In addition, this data has been included as an Indonesian economic indicator [24]. 

There were several experimental models that carried out in this research. The first experimental 
model applied as the controller part in the quasi-experimental model. In this model the training and testing 
process of LVQNN are carried out with pure training data and pure testing data. Then, in the next 
experiment, data treatment has been done by adding weight data to the training data and testing data. 
The purpose is to add the data variant. Another treatment in this research is to arrange the selection of 
parameters which is used to perform distinct data. Therefore the composition of the weighting data, training 
data and testing data are changed according to the chosen parameter to be distinct parametric. The treatments 
are part of the quasi-experimental model. So that we can obtain comparison measures from experiments that 
have been carried out on the control elements in the study. 

Table 1 shows an example of the data used in the study. In this table, there are five parameters that 
will affect forecast result based on the input symptom data. Before the data in Table 1 is processed in the 
LVQNN Algorithm, this data will be filtered using the distinct technique. Distinct in this research is the 
process of grouping data using certain parameters. This process is used to get different values from the data. 
These different values will later be used as weighting data, while the rest will be as input data. This process 
can be done separately by using SQL queries, but in this study this distinct process is used as part of a 
computing process that runs sequentially. 

This process will produce two new tables, namely Table 3 which contains the weight data and 
Table 2 which acts as input data. The purpose of distinct data is to get groups of data that can be used as 
weighting data, and separate them from groups of data that act as input data. 
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Table 1. Indonesian Economic Indicator Data and ORB Price for Rupiah Exchange Rate Forecast period 


IDR Exchange Amount of Export Amount of Inflation ORB Price IDR Exchange Rate 
Rate Import Prediction 
IDR/1USD Juta US$ Juta US$ - US$ IDR/1USD 
8924 12181.6 9654.1 0.44 74.04 8924 
8924 12181.6 9654.1 0.44 75.54 8924 
8924 12181.6 9654.1 0.44 TS 8924 
8924 12181.6 9654.1 0.44 75.49 8924 
8928 14399.6 12120 0.06 81.07 8928 
9802 16133.4 14770.3 -0).03 102.75 9802 
9802 16133.4 14770.3 -0).03 102.11 9802 
Table 2. Example of Input Data as the Result from Distinct 
ala a Amount of Export rie i Inflation ORB Price oa sae 
IDR/1USD Juta US$ Juta US$ - US$ IDR/1USD 
8924 12181.6 9654.1 0.44 74.04 8924 
8924 12181.6 9654.1 0.44 75.54 8924 
8924 12181.6 9654.1 0.44 75.37 8924 
9802 16133.4 14770.3 -0).03 102.11 9802 
Table 3. Example of Weight Data as the Result from Distinct 
Hoe Eeehange Amount of Export AMOUDE OF Inflation ORB Price a Pxcnatie eat 
Rate Import Prediction 
IDR/1USD Juta US$ Juta US$ - US$ IDR/1USD 
8924 12181.6 9654.1 0.44 75.49 8924 
8928 14399.6 12120 0.06 81.07 8928 
9802 16133.4 14770.3 -0.03 102.75 9802 


The computing process will take place based on the LVQNN computing architecture. This process 
begins by comparing each input data to each weight data. In the LVQNN model this process 1s termed the 
calculation of the Euclidean distance value. This Euclidean distance value will determine which weight will 
be updated and how the weight data will be updated. 

There are six experimental models that are tested in this study. The first model is an experiment that 
used Rupiah exchange rate as distinct parameters. This model also applies pure training data and pure testing 
data. The second model adds weight data into the input vector in the training process. The third model adds 
weight data into the input vector in the training process and also in the testing process. However, in the fourth 
model until the sixth model utilizes ORB price as distinct parameters. In the fourth models only use pure 
training data and pure test data the same as the first experiment. The fifth model adds weight data into the 
input data at training process. Finally, in the last experiment the weight data is added into the input data not 
only in the training process, but also in the testing process. 


3. RESULTS AND ANALYSIS 

The results of the first experiment can be seen in Figure |. If we look carefully, it can be seen that 
between the actual and ideal values there is a relatively high deviation. Since from the point of 793 of the 
data trained only a few data that seemed to have actual values that were equal to their ideal values. This can 
be seen in the circled graph section, where in this section there is a graph that intersects the actual value and 
ideal value. 

In order to improve computational accuracy, the second experiment was then carried out by mixing 
weight data with training data. The aim is to add to the training data variant and see its effect on the results of 
training and testing. Unfortunately, there is no significant change in deviation value. As shown in Figure 2, 
changes in graphic form do occur at some point, but not many. 

In the next experiment a weighting of data on training data and testing data were combined. 
The purpose is to get better results from previous models. The graph of the comparison of deviation values 
does change, as shown in Figure 3. 
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Figure 1. Deviation values of the first experimental model with rupiah exchange rate as parameters of distinct 
on population 
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Figure 2. Deviation values of the second experimental model with rupiah exchange rate as parameters of 
distinct on population 
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Figure 3. Deviation Values of the Third Experimental Model with Rupiah Exchange Rate as Parameters of 
Distinct on Population 


In the fourth model, the ORB price parameter was chosen as the distinct parameter, resulting in a 
graph of the comparison of deviation values that was far better than the previous experimental models. 
The graph of actual and ideal values has aligned values as shown in Figure 4. There are some points that do 
not intersect on the graph, it shows there are differences in actual and ideal values. However, the amount of 
deviation is not high. The value is still relatively low and in line with the actual value graph. 
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The results of accuracy on the fourth model have shown significant accuracy improvements. 
However, to get a constant value, the same treatment is given to the next model. Therefore, in the fifth 
experimental model, the addition of weight data to the training data was applied—The results of this model 
can be seen in Figure 5. The figure showed the comparison chart of the actual and ideal values. The graph 
which shown in Figure 5 has a shape that similar to Figure 4. However, if we see in detail, there are 
differences between the actual and ideal values at some points in the graphs. 

In the last experiment, we added weight data to the testing data. The comparison chart of actual and 
ideal values displays a graph that is almost perfect. There are several differences in actual values and ideal 
values on some points, but the value is relatively small, as shown in Figure 6. When compared with the 
values of the previous model, there appears to be an increase in the quality of computing. Not only does the 
deviation value decrease, there is also an increase in the percentage of accuracy. Similar to the value 
movement in the previous experimental model, in this model an increase in value occurs gradually as a result 
of the solutions offered in each experimental model with the same treatment. 
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Figure 4. Deviation values of the fourth Figure 5. Deviation values of the fifth experimental 
experimental model with orb price as parameters of model with orb price as parameters of 
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Figure 6. Deviation values of the sixth experimental model with orb price as parameters of 
distinct on population 


Through the data which presented in Table 4, it can be seen the lowest percentage accuracy ratio and 
the highest of the testing process on population data. The lowest percentage accuracy value is owned by the 
first experiment model, which is a model with the Rupiah exchange rate as the distinct parameter. This model 
produces an accuracy percentage value of 6.053% that used pure training data and pure testing data. 
While the experimental model which has the highest percentage accuracy value 1s owned by the sixth model. 
This model used ORB price as the distinct parameter, which produces an accuracy percentage of 99.09917%. 
This accuracy value is obtained by adding weight data into the training and testing process, to enrich the 
variety of data. 
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As shown in Table 4, the results of testing of several sample models also show a similar pattern. 
The sampling method used in this study was the purposive sampling method. Population data are divided into 
three parts to get a sample with a small amount of data, a sample with a medium number of data, and a 
sample with the same amount of data as the population. The purpose of this data sampling is to make 
observations on the value of the deviation and the level of accuracy of forecasting on different amounts of 
data. The results show a pattern that is relatively the same as the results of testing of the population. 
The amount of data used as training data or test data seems to affect the results of forecasting. In forecasting 
with pure test data displayed in the First Experiment Model, the forecasting accuracy value is very small. 
However, by selecting the appropriate distinct parameters, this increases accuracy or can be relatively 
controlled as in the Fourth Experiment Model. 


Table 4. RMSD Value and Accuracy Percentage in the Testing Process 


Amount of data 


ae Amount of Amount of Amount of aes eee 
Weight Data Training Data Testing Data 

S Model 1 28 265 264 102813.284 16.604 
z @ Model 2 28 293 264 70101.583 16.724 
= A Model 3 28 293 292 70218.065 20.819 
2 t~ Model 4 501 28 28 4036.036 92.857 
oo 2 Model 5 501 529 28 3948 .533 94.915 
= Model 6 501 529 529 213.628 99.622 
S Model 1 58 528 528 1758100.136 13.826 
& @ Model 2 58 586 528 2437133.366 11.092 
= Q Model 3 58 586 586 2412185.043 12.116 
2 = Model 4 950 82 82 11027.463 90.244 
coo = Model 5 950 1032 82 10241.389 97.171 
N Model 6 950 1032 1032 876.213 99.225 

Model 1 86 793 792 1377391.420 6.053 
5 g Model 2 86 879 792 1212421.746 10.580 
3A Model 3 86 879 878 1207483.005 13.538 
=~ Model 4 1437 A, ey 7915.154 90.598 
<x = Model 5 1437 1554 117 7620.529 94.101 

Model 6 1437 1554 1554 798.687 99.099 


Comparison of the average forecasting accuracy based on the selection of parameters for distinct 
data also shows a significant difference. The average value of the percentage of computational accuracy 
tested in the sample based on the "Rupiah Exchange Rate" as the distinct parameter is 13.484%. While the 
average value of the percentage of computational accuracy based on the "ORB Price" which chosen as the 
distinct parameter 1s 95.315%. Through this comparison, it is clear that the "ORB Price" parameter results in 
a better level of forecasting accuracy compared to the "Predicted Rupiah Exchange" as the distinct parameter. 

So, based on the facts produced through observations made during the experiment, it explains that 
the composition of the data greatly influences the results of forecasting. Mixing weight data on training data 
and test data can be a significant solution if the amount of weight data is greater than training data or test 
data. If the number of data weights turns out to be smaller than the training data and test data, then the 
percentage of accuracy will not increase significantly. The addition of weight data for training is related to 
the learning process. Then the results of testing after the process have also been improved. It is different from 
the addition of weight data for testing, which only ensure that the values in the learning process are also 
represented in the testing process. 

One component in this study that has a major role in improving computational quality 1s distinct 
data. The selection of parameters for distinct clauses of data turns out to influence the results of forecasting, 
through changing the composition of the weighting data, training data and test data. Incorrect selection of 
parameters will reduce the percentage of forecasting accuracy. Distinct data can be used as one of the 
considerations to get the best composition of data on forecasting techniques with similar architectures. 
Distinct data are not only focused on the filtering process to ensure there is no data redundancy, but also, 
to provide a data composition to produce better accuracy values through a selection of filter parameters. 
Each combination is unique and affects computing results. 

However, if the combination of values, sacrifices the composition of the weighted data on the 
training data and test data, the accuracy value will also decrease. If the amount of data weight is smaller than 
the input data, the forecasting accuracy will decrease, because the weighting data cannot represent data 
combinations used during training or testing. Conversely, if the amount of data weight is much greater than 
the input data, the forecasting accuracy will also decrease. This condition will occur if the weight data 
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combinations are not represented in the training and testing data. This phenomenon 1s indirectly similar to the 
learning process, where the balance of teaching material (weight data) given and also the composition of the 
test material (data input) determine the level of absorption of learning and test results. 


4. CONCLUSION 

Based on the facts obtained from the research process that has been carried out, it can be concluded 
that the LVQNN algorithm can be used for forecasting. From the results of experiments that have been 
conducted, it was found that the lowest percentage of accuracy of the testing process was 6.053%. Then, the 
highest percentage accuracy value is 99.0991%. Compared based on the selection of parameters for distinct 
data, the percentage of computational accuracy tested in the sample based on the "Rupiah Exchange Rate" 
parameter has an average value of accuracy of 13,484%. While the average value of the percentage of 
computational accuracy based on the "ORB Price" parameter chosen for distinct data is 95.315%. 

An increase in the percentage of accuracy is influenced by the composition of the data. In this study 
the formation of data composition is influenced by distinct data. The selection of parameters which become a 
distinct clause determines how the composition of the data will be formed. If the composition of the data is 
not correct, the test results will not be of good value. If the amount of data weight is smaller than the input 
data, the forecasting accuracy will decrease. Because weighting data cannot represent data combinations used 
during training or testing. When compared to the computational results of other methods used to forecast the 
exchange rate of a currency, the computational results of LVQNN in forecasting the Indonesian Rupiah 
exchange rate against the USD have a relatively good level of accuracy, reaching 99.0991% against 
population data (1671 lines of data). 
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